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Abstract. We prove that, for embeddings of a path of length n in t 2 , the 3-distortion 
is an f2(n 1//2 ), and that, when embedded in R d , the 3-distortion is an 

The general context of this paper is the study of the distortion that appears when a 
metric space is embedded into a Euclidean space. Such a study plays an important role in 
algorithmic geometry and its applications. In particular, significant memory gains can be 
achieved when a metric space is embedded into a low dimensional Euclidean space, and, 
therefore, the study of such embeddings is directly connected with the construction of effi- 
cient computer representations of (finite) metric spaces. The price to pay for such memory 
gains is the inevitable deformations that result from the embedding, and it is therefore 
quite important to control them, typically to understand their asymptotic behaviour when 
the size of the metric space increases. 

A standard parameter for controlling the deformation is the distortion, that takes into 
account pairs of points and compares their distances in the source and the target spaces — 
see precise definition below. The distortion is rather well understood, and, in particular, 
precise bounds for its values in the case of general finite metric spaces are known [2]. 

Now, other parameters may be associated with an embedding naturally. Typically, for 
each k, one can introduce the notion of a k- distortion by taking into account fc-tuples of 
points rather than just pairs, and measuring the way the volume of the associated polytope 
is changed. This is what U. Feige does in pQ in order to construct an algorithm minimizing 
the bandwidth of a graph, i.e., finding a numbering v\, v n of the vertices for which the 
supremum of \i — j\ over all pairs (i,j) such that (vi,Vj) is an edge is as small as possible. 
The idea of PQ is to consider volume-respecting embeddings of the graph into a Euclidean 
space. The point is to show that, among all projections of such an embedding on a line, 
a positive proportion has a minimal bandwidth of the expected size, and the main step is 
to investigate the fc-distorsion. 

Owing to the above applications and connections, understanding ^-distortion for every k 
seems to be a quite natural goal. Now, in contrast to the case k = 2, very little is known so 
far about fc-distortion for k > 3. The aim of this paper is to establish some results about 
3-distortion, in the most simple case of a metric space consisting of equidistant points on 
a line. So, we denote by U n the set {0, 1, n} equipped with the distance d(i,j) = — 
Then, for each d > 2, there exists a real parameter <53(n n ,IR rf ) > 1 that measures the 
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deformation of triangles when U n is embedded in R rf . The intuition is that, the bigger ^3, 
the flatter the triangles — the precise definition is given in Section 1 below. 

As R d isometrically embeds in R d+1 , the inequality 5 3 (IT n , M d+1 ) < <5 3 (II n ,R d ) imme- 
diately follows from the precise definition, implying in particular <$3(n n ,R d ) < <5 3 (n n ,R 2 ) 
for d > 3. The meaning is that, when we have more space, we can more easily embed 
with small distortion. For d = 2 (the planar case), hence for every d, it is easy to see 
that <53(n n ,R rf ) is at most linear in n, so the question is to compare #3(n n ,K d ) with the 
polynomial functions n a , < a < 1. What we do below is to prove one lower bound result 
for d = 2, and one upper bound result for d > 2: 

Proposition 1. The 3-distortion <53(Il n ,R 2 ) is an ^(n 1 / 2 ). 

Proposition 2. For each fixed d, the 3-distortion 5^(H n ,M. d ) is an (^(n 1 /^ 1 )). 

The results are likely not to be optimal: we conjecture that <5 3 (n n ,R 2 ) might be an 
n(n), and that # 3 (n n ,K d ) might be lower than polynomial, typically polylogarithmic, for 
d > 3. This would mean that the behaviour of the 3-distortion radically differs from the 
standard distortion which is polynomial in n for each dimension d. 



1. The 3-distortion 

Our first task is to make the allusive definitions of the introduction precise. 

For (V,p) a metric space and / a non-expanding (i.e., 1-Lipschitz) embedding of V 
into M. d , the distortion A(/) of / is defined to be the supremum of the compression ratio 
between the distance of two points in (V, p) and that of their images in R rf : 

By construction, A(/) is at least 1, and the larger it is, the bigger the deformation of 
distances caused by /. 

Let us turn to k = 3, i.e., let us consider images of triangles. In the denominator 
of (pQ), the length of the segment [f(P),f(Q)] is replaced with the area of the trian- 
gle [f(P),f(Q),f(R)]. As for the numerator, the area makes no sense in the source 
space (V, p), but we observe that, at least in good cases, p(P, Q) is the sup of the lengths 
Dist((7(P), g(Q)) for g a non-expanding embedding of V to R rf (provided d > 1). This 
naturally leads to defining p^(P,Q, R) to be the sup of Area([<?(P), g(Q), g(R)]) for g a 
non-expanding embedding of V to R rf (provided d > 2), and to defining the 3-distortion 
of / to be 

We shall be interested in the minimal possible value of A 3 (/), i.e., in the configurations 
that minimalize the distortion of triangles. We are thus led to the following notion: 

Definition. The 3-distortion 6s(V, M. d ) is defined to be the infimum of A 3 (/) over all 
non-expanding embeddings / of V into R rf . 
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Figure 1. The 3-distortion of a non-expanding embedding / : II2 — > M 2 : 
(i) generic case: the area is absm(9)/2, whence ^(f) = l/absin6, (ii) an 
optimal case: A3(/) = 1; (in) a worst case: the isometrical embedding; then 
the image of / is a flat triangle of area 0, hence A%(f) = 00. 

The definition for A;-tuples would be similar, with volume replacing area. 

Figure Q] describes the situation for the graph II2. In this (very simple) case, there exist 
embeddings with 3-distortion equal to 1, namely the ones of Figure [l](zz) , and, therefore, 
we find 83(112) = 1- 

In the general case, we always have A3(/) > 1 by construction, and, the flatter the 
triangles in the image of /, the larger A3(/). For instance, when / is an isometrical 
embedding of Tl n in R rf , all triangles are flat, as in Figure QJm), and the distortion A3(/) 
is infinite. Thus the 3-distortion is a measure of the inevitable flattening of triangles that 
occurs when a (large) metric space is embedded in some fixed Euclidean space: then, it 
is impossible that all triples of vertices are embedded so as to form a rectangular triangle 
as in Figure HKii), an d the question is to evaluate how far from that one must lie. The 
reader can check that, even in the case of embeddings of II3 into M 2 , it is not so easy to 
prove that the minimal 3-distortion is 2/\/3 = 1, 1547..., corresponding to a U-shape with 
length 1 edges and 2ir/3 angles, and obtaining an exact value in the general case of II n 
seems out of reach. This contributes to making asymptotic bounds desirable. 

In the specific case of the space II n , i.e., of n equidistant points at distance 1 on the 
real line, the definition of 3-distortion can be given a more simple form. Indeed, if g is a 
non-expanding embedding of II n into M. d , we have Dist(g(i), g(j)) < \i — j\ and therefore, 
fori <j< k, we find Area([g(i), g(j), g{k)]) < (j — i)(k— j)/2; on the other hand, provided 
d > 2, we can always find g such that the latter inequality is an equality as in Figure QJn). 
Hence, for < i < j < k < n, we have 

P3 (i,j,k) = (j-i)(k-j)/2. 

So, for / a non-expanding embedding of II,,, into M. d , (T2|) takes the form 

< 3 > A ^-"K A^w'm'/w]) ;0£i<J< ^"}- 

In the sequel, we shall forget about embeddings and only work inside the target space M. d . 

Definition. A finite sequence of points (Mo, . . . , M n ) in M d is said to be tame if, for each i, 
we have Dist(Mj, M i+ i) < 1. In this case, we put 

(4) A 3 (M , . . . ,M„) = SUP { J^M^l^; < i < i < k < „}. 
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If / is an embedding of Tl n into R , then the sequence (/(0), . . . , f{n)) is tame, and, 
conversely, each tame sequence determines a unique embedding. Now, translating (|3|) 
gives (j4]) for Mj = f(i) and the notation is consistent. Then the 3-distortion of U n can be 
expressed in terms of tame sequences of points: for all n, d, we have 

(5) 5 3 (U n , R d ) = inf{A 3 (M , . . . , M n ); (M , . . . , M n ) a tame sequence in R d }. 

Thus, from now on, our aim is to study the possible values of the quantity ^3(n n ,]R rf ) 
of ©. 

2. A LOWER BOUND IN THE PLANAR CASE 

In order to prove Proposition [H we shall consider an arbitrary tame sequence in M 2 , 
and prove that some triangle is much distorted, i.e., flattened. To this end we observe 
that points in convex position provide a triangle with large 3-distortion. 

Say that a sequence (Po, . . . , P m -i) of points in the plane is convex if the boundary of 
the convex hull of {Po, . . . , P m -i} is exactly the polygon with vertices Po, . . . ,P m -i in 
this order. 

Lemma 3. Assume that (Pq, . . . , P m -i) is a convex sequence with m > 3. Then there 
exists i such that the 3-distortion of the triangle — where indices are taken 

modulo m — is at least m/(2ir). 

Proof. The sum of angles ZP ^i^2 + £P\PiPz + ■■■ + ZP m _iP Q Pi is (m - 2)2vr. As all 
angles are positive and less than n, one of them is at least ^^-ir. The 3-distortion of the 
corresponding triangle is then at least m/(2ii). □ 




Figure 2. Convex sequence of points 



Lemma 4. Assume that (Mo, . . . , M n ) is a tame sequence in M 2 ; and that 5 is an integer 
greater than or equal to A3 (Mo, . . . , M n ). Then the sequence (Mo, M$, M 2 s, ■ ■ ■ , Mi « u) is 
convex. 

Proof. Let 5 := A 3 (M , . . . , M n ). For all % < j, we have Dist(M i; Mj) < \j - i\. Since for 
k > j the area of the triangle [Mj, Mj, M^] is at least ; hence a fortiori ' 

the distance between the point Mfe and the line (M$Mj) is at least ^p- (FigureE]). There- 
fore, for k > j + 5, the points M& and M^+i lie on the same side of the line (MjMj): 
otherwise, the distance between M& and M^+i would be at least 2^-^-, contrary to the 
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Figure 3. Minimal distance from the point M k to the line (MjMj) 

tameness hypothesis. Hence, for k > j + 5, the point M k lies on the same side of the line 
(MiMj) as Mj +S . 

For a contradiction, assume that, for some i, the sequence (M^, Mu + ug, M( i+2 )$, 
M(j + 3) ( 5) is not convex. Then either the four points are not in convex position, or they are 
in convex position but they do not appear in the right order on the border of their convex 
hull. 

In the first case (Figure E]), one point lies in the convex hull of the three others. But 
this contradicts the hypothesis that adjacent points lie on the same side of each line 
(M jS M (j+1)s ). 

In the second case (Figure [5]), the points are in convex position, but the segment 
[M(j +1 ),5, M( i+2 )a] crosses the line (Mi$M( i+3 ^). Then there exists j with (i + 1)5 < 
j < (i + 2)5 such that the distance from Mj to (MisM^ i+3 ^) is at most 1/2. The area of 
the triangle [M^, Mj, Mr i+S \g] is therefore at most 35 /4. On the other side, by definition 
of <5o, this area is at least ((i + 3)5 — j)(j — i5)/25o, hence a fortiori ((i + 3)5 — j)(j —i5)/25. 
Since {i + 1)5 < j < (i + 2)5, the latter quantity is at least 5, a contradiction. □ 




Figure 4. Four points not in convex position: a problem arises between 

{i + 2)5 and (* + 3)<5 

Proof of Proposition 1. Let (Mo, . . . , M n ) be a tame sequence in R 2 , and let 5 be [A3(Mo, . . . , M n )~\ . 
If we have [_tJ < 2, then we have 5 > n/2, hence 5 G ^(n 1 / 2 ) a fortiori. Assume now 
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(t + l)«5 




i + 3)5 



(i + 2)6 



•problem). 

Figure 5. Four points not in ordered convex position: a problem arises be- 
tween {% + 1)5 and (i + 2)5 



lj\ > 2. Then by LemmaHl the sequence (Mo, Ms, ■ ■ ■ , Mi au) is convex, and by LemmaO 
there is a triangle whose distortion is least [_t J/2vr. By definition, this quantity is at most 
8, hence we have 5 £ ^(f )• So in any case, 5 3 (n n ,M 2 ) lies in Q(n 1 / 2 ). □ 

Remark. The proof of Lemma H] gives many constraints for the sequence (Mo, . . . , M n ). 
Here we use these constraints to construct a convex subsequence of size y/n, but it is 
likely that larger subsequences with properties slightly weaker than convexity could be 
constructed as well. So we think that the result of Proposition [I] is not optimal. 

3. Construction of a ^-dimensional embedding 

Now we turn to dimension d and we wish to establish the lower bound result stated as 
Proposition [2j Our aim is to construct for each n a tame sequence of length n in M. with 
a small 3-distortion, i.e., such that all extracted triangles are not too much flattened. 

A natural idea would be to construct the nth sequence (Mo, n , • • • , M niW ) by taking more 
and more points on a single curve T of length 1, and rescaling. But then a small 3- 
distortion would require a complicated curve T. Indeed, assume that T is an immersion 
of class C 2 . As r is compact, the infimum rp of the radii of the osculating circles of V is 
reached at some point, and therefore it is non-zero. For any n, there exists % such that the 
curvilinear distance between Mi n and Mj + 2, n is lower than 2/n before rescaling. Then the 
distances between Mj n and Mj + i jn , and between Mj+i n and Mi + 2,n are lower than 2/n 
too. Therefore the sine of the angle between the lines (Mi n Mj_|_i n ) and (Mj_|_i in Mj_|_2, n ) ^ s 
at most r-p/n, and the distortion of the triangle Mj )n Mj_|_i jn Mj_|_2 in is at least n/r-p. This 
leads to a 3-distortion in Jl(n) for (Mo in , . . . ,M n ^ n ). So, in order to construct sequences 
of points with small 3-distortion, we have either to use curves depending on n, or to use a 
non-C 2 curve (typically a fractal curve). In the following construction we choose the first 
option. 

Proof of Proposition 2. For simplicity, we assume n = m^ 1 for some m. We recursively 
construct a family of curves T m ^ in and, on each of them, we mark m d ~ l + 1 points 
-Pm,d,o, • • • , P m ,d, m d-i in such a way that A 3 (P mA0 , . . .,P mAm d-i) lies in 0(m) for each 
fixed d. 

When m + 1 points lie at mutual distance 1 on an arc of circle, the 3-distortion is in 
0(m). The idea of our construction is to use this fact and to recursively put circles one 
above the others. 

Let To be the sixth of a circle whose radius r will be chosen later. On To we put points 
Po, . . . , P m with regular angular distance Then we replace the arc between Pi and Pi+\ 
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with a coplanar arc of radius 2r lying between the original arc and the chord connecting 
Pi to Pi+i. We rescale the figure so that the curvilinear coordinate of Pi becomes i for 
each i. We let T mt 2 be the resulting curve (oriented from Pq to P m ) and P m ,2fl, ■ ■ ■ , Pm,2,m 
be the marked points on F m> 2- 




Figure 6. On the right: the curve T m ^ and the points P m ,2,o, ■ ■ ■ Pm,2,m- 
On the left: three points A,B,C with at least one P m ,2,i between them yield 
an angle ZABC < 7r(l — ^) 



The main remark for the proof is that, for all triples A, B, C taken in increasing order 
on T m ^2 (not necessarily some Pm 2 i 

's) and not all lying on some arc (P m ,2,iPm,2,i+i)i we 
have Z ABC < 7r(l — g^). By construction, the Euclidean distance between two points of 
T mi 2 is at least 3/tt times their curvilinear distance, and therefore the 3-distortion of the 
triangle ABC is in O(m). 

The idea for the induction is to add a copy of T rrit 2 between P m ,d-\,i and P m ,d-i,i+ii 
orthogonally to the hyperplane in which T m ^-i lies. More precisely, we construct T m ^ 
and P mA0 , • • • , P m ,d, m d-i horn T mid _i and P m ,d-ifl, P m ,d-\,m^ so that tne following 
induction hypothesis is preserved: 

(i) ^m,d is a curve of length m d ~ x in R d such that two points at curvilinear distance i 
lie at euclidian distance at least (2/\/3) _d+2 7r/3 x £; 
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(ii) If A, B, C are three points that do not all lie on some arc (P m ,d,iPm,d,i+i) for any i, 
then the 3-distortion of the triangle [A, B,C] is at most c^m, where c& = (2/\/3)~ d+2 xG/ir. 
The induction hypothesis holds for d = 2. 

The construction of Y m ^ is as follows. We identify R d with R d_1 x R, where R d_1 is 
the space containing Y m ^-i- Next we work in the cylinder Z m ^-\ defined by Y m ^-i x R + 
with the induced metric. Note that this cylinder Z m ^-\ is orthogonal to the hyperplane 
containing Y m ^-i- For each i between and m d ~ 2 — 1, we insert in Z m ^-\ a rescaled copy 
of r mi 2 from P rn ,d-i,i to P m ,d-i,i+i- I n this way, we obtain a curve on which m^ 1 + 1 
points are marked: the P m ,d-i,iS from Y m ^-i plus m d ~ 2 x (m — 1) new points between 
fm.d-i.i and P mjd -i,i+i for « = 0, . . . , m d ~ 2 - 1. We denote them by P m ,d,o, P m ,d,m d -^ 
according to the linear ordering. We then rescale the figure so that the curvilinear distance 
between consecutive points P m ,d,iS is 1- We call Y m ^ the resulting curve. 




Figure 7. The curve r m 3 in the space 

It remains to show that the induction hypothesis is preserved. 

For (i), we observe that the angle between any chord of Y m ^ and the hyperplane con- 
taining Y„ h d~\ is lower than ir/6. Therefore, when going from Y mj d-i to Y m> d, no distance 
is decreased by more than a factor 2 / 

For (ii), let A, B, C be three points on Y m ^ and let i be such that A lies before P m ,d,i 
and C lies after P m ,d,i according to the fixed curvilinear ordering. 

First case: There exists j such that A,B,C lie between P mt d,jm and Pm,d,(j+i)m- This 
means that A, B, C lie on some copy of Y m ^ in Z m ^-\ inserted in the last step of the 
inductive construction. In the case of Y m ^, we know that the 3-distortion is at most c^vn. 
Here there is an additional 3-distortion due to the fact that the copy was made on the 
cylinder Z m ^-\- The projection of Y m ^ on R d_1 is Y m ^-i, and not a line as in the d = 2 
case. By induction hypothesis, the distances on Y m d-i (compared with the Euclidean 
distances) are not contracted by more than (2/v / 3) _d+1 vr/3, hence the distortion of the 
triangle [A, B,C] is bounded by (2/v / 3) _d+1 vr/3 x C2m < Cdjn. 

Second case: There exists j such that A lies before Pm,d,jm and C lies after P m ,d,jm- 
Then, when A, B,C are projected from Y m ^ on Y m ^-i along Z m ^-\, the area of the 
triangle [A, B,C] decreases by a multiplicative factor at most \/3/2. By the induction 
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hypothesis the projection of the triangle has 3-distortion at most Cd-im, therefore the 
original triangle [A, B, C] has 3-distortion at most c^m. □ 

Remarks, (i) The choice of the curve T m ^ may look strange, in particular the choice of 
an arc of radius 2r between P m ,i and P m ,i+i rather than an arc of radius r or a chord. 
The reason is that, in both cases, the key property, namely that the triangle [A, B, C] has 
3-distortion 0(m) if A, B, C do not all lie on some arc (Pd,m,iPd,m,i+i) , fails. With arcs of 
radius r, if we take A, B, C close to some Pd, m ,i, then the 3-distortion of [A, B, C] can be 
arbitrary large. With chords, if we take A, B strictly between Pd, m ,i an d Pd,m,i+i and C 
just after Pd, m ,i+i, then the 3-distortion is not bounded either. 

(ii) Our construction uses d — 1 pairwise orthogonal directions to draw the curves Y m ^ 
one above the other. We could use other fixed directions as well, the point being that the 
projections preserve the convexity of the specific patterns we consider. Alternatively we 
could replace cylinders by cones, as central projection also preserves the needed convexity. 
But it seems difficult to use more than one cylinder, and therefore more than one curve, 
for each new dimension, because no projection preserves the needed convexity for several 
sufficiently distinct directions simultaneously. 
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